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Abstract 



We study a gas containing two components, a small component 
P and a large component Q. P is selectively heated to a high tem- 
perature and then returns to equilibrium via collisions with Q. This 
thermal equilibration process is analysed in a new way. We divide 
the kinetic energy space of the molecules of P into two regions F and 
D, and show that the molecules of P randomly switch ('oscillate') 
between the two states as time proceeds due to collisions with the 
molecules of Q. Initially, the molecules of P are all in the state D, 
however because each molecule in P collides with the molecules of Q 
at different times, the oscillations occur out of step with each other. 
There is a net destructive interference between the oscillations, and so 
they are not observed when monitoring the average kinetic energe of 
the molecules of P as a function of time. We will explain the similari- 
ties and differences between this observation and transverse relaxation 
processes that occur in magnetic resonance spectroscopy. This study 
employs a stochastic model of elastic collisions between the molecules 
of P and Q, and for completeness we examine its relationship with 
the two major models of thermal equilibration in statistical physics, 
namely the Boltzmann equation and the Ornstein-Uhlenbeck process. 
PACS numbers: 05.40.-a, 51.10+y 

1 Introduction 

Consider a gas with two components P and Q, and suppose that Q has 
contains considerably more molecules than P. If P is selectively heated to 
a high temperature, then collisions between the molecules of P and Q will 
return P to its equilibrium state. We will call this process thermal equili- 
bration. The starting point for studying thermal equilibration theoretically 
is the Boltzmann equation. This has been the the subject of considerable 
research for many years and has produced a vast literature. See [1] and [2] 
for recent reviews on the topic and a large list of references. Research over 
the last decade has focused mainly on the rate of approach of the Boltzmann 
equation to equilibrium, and various bounds have been deduced for spatially 
homogeneous [3, 4] and inhomogeneous [5, 6, 7] systems. An alternative 
approach is provided by stochastic models of Brownian motion, usually the 
Ornstein-Uhlenbeck process and other diffusion processes. Stochastic models 
of the Brownian motion have also spawned an enormous literature (see, for 
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example, [8] and [9]). The advantage of stochastic models of the Brownian 
motion over the Boltzmann equation in modelling thermal equilibration is 
that they can be solved relatively easily. However, this is matched by an im- 
portant disadvantage, namely that they provide relative little information on 
the microscopic collision dynamics. Recent studies on this topic have there- 
fore looked at the connection between diffusion processes and more detailed 
collision models in certain limits. These include Lorentzian gas models [10, 
11], heat bath models [12], and models of systems interacting with sequences 
of classical [13] and quantum systems [14, 15]. Through studies involving the 
Boltzmann equation and stochastic models, a relatively detailed picture of 
ther thermal equilibration process is beginning to emerge. 

This paper will provide a further insight into the thermal equilibration 
process. We will show how thermal equilibration can be interpreted in a sim- 
ilar way to an apparently very different non-equilibrium process. Namely, 
transverse relaxation in a magnetic resonance experiment (also known as T 2 
relaxation and spin-spin relaxation). Figure 1 illustrates transverse relax- 
ation for a group of spins embedded in a crystal. Omitting several details, 
the magnetic resonance experiment involves aligining the spins vectors in the 
xy-p\ane with an electromagnetic pulse at time 0. Following this, the spin 
vectors rotate in the xy-plane together and in near unison. However, random 
interactions with the environment surrounding each spin causes the angular 
frequencies of the spins to fluctuate with time. This causes the spin vectors 
to fall out of alignment with each other. If we plot the x or y component 
of the spins as a function of time, we therefore see a series of oscillations 
that gradually fall out of step with each other (Figure 2) [16, 17, 18, 19, 20, 
21, 22]. Transverse relaxation can monitored in the laboratory by observing 
the decay of the net electrical current induced by the rotating spins with 
time. This decay can be thought of as arising from destructive interference 
between the oscillations associated with each spin, like those shown in Fig- 
ure 2. The analogy between thermal equilibration and transverse relaxation 
is constructed as follows. The kinetic energy space of the molecules in P 
is divided into a 'low energy' region and a 'high energy region'. Initially, 
the molecules in P all start in the same region of the kinetic energy space 
(Figure 3). However, as time proceeds the molecules of P switch ('oscillate') 
between the two states due to collisions with the molecules of Q (see Figure 
4). Because each molecule in P collides with molecules of Q at different 
times, the oscillation associated with each molecule falls out of phase with 
one another. There is a net destructive interference between the oscillations, 
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and therefore the individual oscillations are not seen when the thermal equi- 
libration process is monitored through the average kinetic energy of P. The 
'oscillations' that are described here are not true oscillations because they 
only involve shifting between two states, rather than a continuous spectrum 
of states like we have with the x or y component of a spin. However, the ter- 
minology is useful for describing the analogy with the transverse relaxation 
problem. The model that we will use involves randomly occurring elastic 
collisions between the molecules of P and Q, and falls somewhere between 
the Boltzmann equation and the Ornstein-Uhlenbeck equation in terms of 
its physical detail. For completeness we will therefore show how the model 
is related to the Boltzmann and Ornstein-Uhlenbeck equation. 

Section 2 describes the model and its basic properties and shows how it is 
related to the Boltzmann equation and the Ornstein-Uhlenbeck process. Sec- 
tion 3 then describes the analogy between transverse relaxation and thermal 
equilibration in detail. 

2 Model description 

Consider a one-dimensional, spatially homogeneous gas with two compo- 
nents, a small component P and relatively large component Q. The molecules 
in P and Q have masses m p and m q , respectively, with m p > m q . Suppose 
that at time the molecules in P are instantaneously brought out of thermal 
equilibrium by an outside influence (e.g., a laser pulse). Consider an individ- 
ual molecule Pq G P. From time 0, the first particle from Q that Pq collides 
with is denoted by Qi, the second particle by Q2, and so on. We assume 
elastic collisions between the molecules of Q and P, i.e., upon colliding with 
Q n the velocity of P becomes 



V n = cV n -! + X, 



(2.1) 



where X n is the velocity of Q n and 



m p — m, 



m p + m q 



By induction, (2.1) is 



n 




(2.2) 



k=i 
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We will assume that X ± , X 2 , . . . are independent J\f(0, a^) random variables, 
and that Vq ~ A/"(0, <Jq) and is independent of X±, X2, .... These assumptions 
are standard assumptions in gas-gas collision models. We will also ignore 
collisions between molecules in P. 

Definition 1. The stochastic process V = {V n } n( z{o,i,...}, where each V n G 
V is given by (2.2), is called the collision velocity process. V is defined 
with respect to the probability space V = (Q, J 7 , P). 

In the laboratory, equilibration is measured with respect to time, rather 
than the number of collisions that have occurred. Let U\ be the time of 
collision between P and Q 1: U 2 — Ui the length of time between colliding with 
Qi and Q 2 , and so on. We will assume that Lq, U 2 — U\ ■ ■ ■ are independent 
exponential random variables, i.e., for all k, 

P(U k - C4_! < u) = 1 - e" AM , (2.3) 

where A is the average frequency of collisions, and that Lq, U 2 — Lq . . . are 
independent of Xi, X 2 , . . . and V . This means that the number of collisions 
that Pq has experienced by time t is the value of a Poisson process iV at time 
t. This process is also independent of Xi,X 2 , . . . and Vq. Thus, the velocity 
of the particle at time t is simply 

N(t) 

V(t) = c N ^V + c N{t) ~ k X k . (2.4) 
k=i 

Definition 2. The stochastic process V T = {V(t)} teK+ , where each V(t) G 
V T is given by (2.4), is called the time velocity process. V T is also defined 
with respect to V . 

The time parameter of the time-dependent stochastic processes is written 
inside of the parenthesis to distinguish the time velocity process from the 
collision velocity process. 

The advantage of defining the 'collision velocity process' and the 'time 
velocity process' separately is that relatively difficult calculations on the time 
velocity process can instead be performed on the collision velocity process. 
For example, for the collision velocity process we have 

E{V n ) = 0, (2.5) 
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var(K) = c 2 V 2 + o* (j^) ■ (2-6) 

for all n. To prove the second result, note that V n is a weighted sum of 
independent normal random variables, and so 

n n—l 

var(K) = c 2 V 2 + £ c 2 («- fe V 2 = c 2 V 2 + a 2 £ c 2fe . 

fc=l k=0 

Because E(E(V(t) 2 \ N(t))) = E(V^^), we can work out the variance of the 
time velocity process at time t by conditioning arguments. We find that 

var(y(0) = a 2 e- xt ^ + (l - e^ 1 "^) . (2.7) 

A good definition of an 'equilibrium distribution' is also needed to discuss 
the equilibration process. The following is satisfactory. 

Definition 3. The collision velocity process and time velocity process are 
said to have an equilibrium distribution if for all v e M, lim^oo P(V n < v) 
and lim^oo P(V(t) < v) are well-defined probabilities. 

Theorem 2.1. The collision velocity processes and time velocity processes 
each have an equilibrium distribution. 

Proof. Choose a v e R. Each V n G V is a sum of normal random variables 
(2.2) and are therefore normal random variables with mean and variance 
given by (2.5) and (2.6). These two quantities determine the distribution. 
This means that P(V n < v) — > (27r(T 2 <) ) _1 ^ 2 J* u exp(— u 2 /(2a 2 <i ))du, which is a 
well-defined probability. 

We can expand the distribution of any V(t) G V T with the total rule of 
probability, i.e., 

oo 

P (V(t) <v) = J2 P (V(t) < v I N(t) =n)P (N(t) = n) . 

k=0 

From the Poisson distribution (P(N(t) = n) = exp(— Xt)(Xt) n /n\) we can 
show that lim^oo P(N(t) = n) = 1 if n = oo and is zero otherwise. There- 
fore, 
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lim P (V(t) < v) = P(V(t) < v | N(t) = oo) 
= lim P (V n < v) , 

n^oo 

which is well-defined by the previous result. The theorem then follows from 
the fact that v is arbitrary. □ 

The above proof shows that the equilibrium distributions of both the collision 
velocity process and the time velocity process are normal with mean zero and 
variance 0^/(1 — c 2 ). 



2.1 The Boltzmann equation 

To establish the connection of the above model with the Boltzmann equation, 
let us briefly ignore the results in the previous section and consider how a 
physicist would approach the problem. We will deliberately gloss over certain 
mathematical technicalities to stay true to the approach. In its most general 
form, the Boltzmann equation for a spatially homogeneous gas under no 
external forces is 



df(v,t) df(v,t) 



dt dt 



(2.8) 



coll 

The probability density f(v , t) is proportional to the number of molecules in 
P with velocities in (v,v + Sv) at time t, where Sv is a small constant. The 
collision term df(v,t)/dt \ co u is equal to the sum of a loss and a gain term. 
To calculate the loss term, notice that for very small Sv almost all molecules 
in P with velocity v will leave (v, v + 8v) if they experience a collision with 
a molecule from Q. This can be seen directly from equation (1). Therefore, 
the change in the density of molecules in (v, v + 5v) due to such collisions 
during a short time interval of length 8t is 

S fioss = (f(v,t)- PcM (6t)f(v,t)) - f(v,t), (2.9) 

where p C oii(8t) is the probability of a collision during the interval St. The 
first term on the right-hand side of (2.9) is the fraction of molecules in P 
with velocities in (v, v + 8v) at time t that still have velocities in (y, v + 8v) 
after the time period 5t. Expanding p co u(St) to first order in St gives 
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Pcou(5t) = aSt (2.10) 

where a is a constant. To obtain (2.10), p co ii(0) was set to 0. We therefore 
have S fi oss = —aStf(v, t). Dividing through by St and taking the limit Sv — > 
and St — > gives the loss contribution of df{v,t)/dt\ coll : 



df(v,t) 



dt 



= -af(v,t). (2.11) 

loss 



As for the gain term, consider a molecule in P which, at the beginning 
of the time interval St, has a velocity (v — x)/c. According to (2.1), this 
molecule will acquire a velocity in (v,v + Sv) if it collides with a molecule 
from the medium with velocity x. Letting g{u) denote the velocity density of 
the surrounding gas (which is time independent), the change in the velocity 
probability density of the molecules in P at point v due to these collisions is 

/oo 
g(x)f((v - x)/c, t)p coll (St)dx. 
-oo 

Expanding p co ii(St) to first order in St, dividing through by St and taking the 
limit Sv — >■ and St — > gives the gain contribution of Sf(v,t)/St\ coll , 



df(v,t) 



dt 



/oo 
g(x)f((v-x)/c,t)dx. (2.12) 
-oo 



gam 

Substituting (2.11) and (2.12) into (2.8) gives 

= a f{{v _ x y Ci t)g{x)dx _ af{Vj t) , (2.13) 

(2.13) is the Boltzmann equation for our elastic collision model. The 
constant a could be determined by initial conditions, however we will not 
do this here. Let us now consider the time velocity process described in the 
previous section. The time velocity process is a stochastic representation of 
(2.13) in the following sense. 

Theorem 2.2. Let f(v,t) denote the probability density function of the ran- 
dom variable V(t) G V T at point o£l. Then df(v,t)/dt is given by (2.13) 
with a = A. 
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Proof. Conditioning V(t) on {N(t) = n}, we can write 



P(V(t)<v) = P (V(t) < v | N(t) =n)P (N(t) = n) 

?1=0 

oo 

= Y d P{V n <v)P{N{t) = n). 



n=0 



Because V n is a normal random variable, P (V n < v) is a.e differentiable with 
respect to v. Differentiating the above equation term-by-term then gives a 
well-defined expression for the probability density of V(t), namely 

oo 

f(v,t) = Y,fn{v)P(N(t)=n), 

n=0 

where f n {y ) is the probability density of the random variable V n . Differenti- 
ating the above with respect to time gives 

n=0 

By differentiating the Poisson distribution we can show that dP(N(t) = 
n)/dt = XP(N(t) = n - 1) - XP(N(t) = n). Therefore, 



df(v,t) 
dt 



= ^J2 (Uv)P(N(t) = n - 1) - f n (v)P(N(t) = n)) (2.15) 



71=0 



where P(N{t) = —1) is defined to be zero. The term on the far right of 
(2.15) is equal to \f(v,t). The first term can be re-written as 

oo oo 

J2fn(v)P(N(t) = n - 1) = J2fn + i(v)P(N(t) = n). (2.16) 

n=0 n=l 

According to (2.1), the density of V n+ \ is a convolution of the density of V n 
at point (v — x)/c and the density of X n+ \ at point x. That is, 

/oo 
f n ((v-x)/c)g(x)dx. (2.17) 
-oo 
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Substituting (2.17) into (2.16) and carrying out the sum gives 

00 poo 

Y J Uv)P{N(t)=n-l)= / f{{v-x)/c)g{x)dx. (2.18) 

n=0 

Substituting (2.18) into (2.15) gives the result. □ 
2.2 The Ornstein-Uhlenbeck process 

Now we will show that as successive collisions between P and molecules of Q 
occur more and more frequently, the time velocity process converge pointwise 
in probability to the paths of an Ornstein-Uhlenbeck process. The limit that 
describes this is A — > 00. A sequence of process Y 1 ,Y 2 , . . . is said to converge 
to a process Z pointwise in probability if P(\ Y n (t) — Z(t) \> e) — > for all t. 

To prove this, we will construct the time velocity process in a slightly 
different way. Consider the probability space V = (fl, J 7 , P). Let Ai < A2 < 
• • • be a sequence of positive constants such that \ n — > 00, and define the 
family {n n }™ =l such that 

ir n ={0<t T l<t 2 2 <•••}, (2.19) 
where t% — > 00 for each n and 

mesh(7r„) = 1/A n 

for each n. Next, let W = {W(t)} t eR + be a Wiener process (following the 
standard definition, e.g., [25]) on V and define a sequence X™,X%, . . . for 
each n such that 

XZ = a° x AW(q +1 ), 

where AW(tJ} +1 ) = W(q +1 ) - W{tf) and > is a constant. Finally, let 
the V^, V 2 T , ... be a sequence of stochastic processes on V, where for each 

V(tr e V n T , 

N(t) n 
k=l 

where constant and < c n < 1. As with the time velocity process 

defined earlier, Vq ~ jV(0,(Tq) and each Poisson process N n is independent 
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of the sequence X™, XV; , . . . and V . For each n, X™, ... is a sequence of 
independent jV(0, cr°/A n ), and so for each n the time velocity process defined 
here satisfies the criteria of Definition 2. 

We will now make an important addition to the above construction. As- 
sume that there exists a constant a < 1 such that 

c n = a 1/Xn . 

This means that c n — > 1 as A — > oo. This can be interpreted as follows. 
Recall the definition of c for the processes constructed in section 2, 

m„ — m„ 

c = — -. 

m p + m q 

In this case, c — > 1 as m p — > oo. The assumption that c n — > 1 in the present 
construction might be therefore be taken to mean that m p becomes very 
large compared to m q . In other words, the limiting time velocity process 
describes a particle with a very large mass, just as is assumed in the usual 
theories of Brownian motion. We can use this interpretation to understand 
the definition of X£ given above. According to this definition, 

varX™ ->■ 

This means that the range of velocities in the surrounding gas becomes 
very narrow in the limit. This can be understood by imagining ourselves 
riding on P as it travels through the medium. Because Po is very heavy and 
slow compared to the particles of the surroundings, the particles of the sur- 
roundings appear to be moving extremely fast, too fast for us to distinguish 
their speeds. The assumptions used in this construction can be regarded 
as a renormalisation of the time velocity process. We have employed similar 
renormalisations in similar studies of weak convergence to Gaussian processes 
[22, 24]. 

Theorem 2.3. Let Y = {Y(t)} t £R + be the following OU process. For all 

Y(t)eY, 

Y(t) = Y(0)e~ 9t + r] f e~ e ^dW(s), (2.20) 
Jo 

where 9 and rj are positive constants and W is the Wiener process given above. 
Suppose that Y(0) = V a.s. and that the integral J * exp(— 0{t — s))dW{s) is 
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an Ito integral. Then converges pointwise in probability to an OU process 
as n — > oo. 

Proof. Using the assumption c„ = a 1 ^", V(t) n can be rewritten as 

N(t) n 

V(t) n = V a N ^ n / x " + a {mn ~ k)/x "AW(t n k+1 ). 

k=i 

And if we set 

= -lna (2.21) 

we can write (2.20) as 

Y(t) = Voa* + r] [ a'-'dWis). 
Jo 

Let 

A(t) = a N ^ n / x "V - a'Vo 

and 

N(tr ft 

B(t) = o-° x Y] a^ n - k ^ x "AW(tl +1 )- V / c^W(s). 

Now, let Ui, U2 ■ ■ ■ be the jump times of the Poisson process N n , and 
Uf = sup k (UJ? < t). To prove that A(t) and B(t) converge in probability to 
zero, we will first show that for an arbitrary n and k < N(t) n , \ n U k can be 
brought arbitrary close to k (with respect to an appropriate metric d(x, y) 
on the space of random variables topologised by convergence in probability, 
e.g., the Ky Fan metric) with increasing n. For an arbitrary k < N(t) n we 
can use the strong law of large numbers to form the approximation 

k 

Ul = Y,K l ~kE(K 1 ) 

i=i 

where the approximation can be made arbitrary accurate (with respect to 
the metric d) by increasing n. Thus, \ n U'k can be brought arbitrarily close 
to k by increasing n, as claimed. 

To prove that A(t) — > in probability, we make the approximation 
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A(t) « V a u " - V a'. 

The continuous mapping theorem shows that this approximation can be made 
as accurate as desired (with respect to d) by increasing n. Because a > 1, 
we can write 

V a u " - Voa* < V a u " - V a u "+^ 

where Ufa = inf(£/£ > U?). (2.3) shows that Ufa - U? ->• in probability, 
and appealing to the continuous mapping theorem once again shows that 
V a u " — V a l — > in probability. We then have that A(t) — > in probability. 
As for | B(t)\, we can write 



B(t)\ < 



N(ty> 



N(ty 



a o J- a WN)/^ (tW _ a o J- a u ^AW(tl 



k+1, 



k=l 



k=l 



(Tl) 



+ 



N(ty 

oil ^ O 

k=l 



*AW(t n k+l )- v [ a*"W( a ) 
./o 



(T2) 



This is because for any three functions /(£), g(t) and — <?(i)| = 

|(/(t) - h{t)) + - ff (t))| < \f(t) - h(t)\ + \h(t) - g(t)\. Similar to what 
was done previously, we can make the approximation 



N(t) n N(t) n 
k=l k=l 

which improves in accuracy as n — > oo. Then P(T1 > e) — >■ trivially for 
all e > 0. As for the term (T2), define the family {/i n }^Li, where \i n = {0 < 
< < ■ ■ ■ < U? < t}. Each random partition of [0, £]. We will 

say that the family {/i n }^i 1 tends to [0,t] if 



1. Hindoo C/ t n = t in probability. 

2. Hindoo [/" = in probability. 

3. sup fc \ U% — U k _ x \ — > in probability. 
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To check 1, let e > and note that if U™ +1 — U™<e then t — U™ < e. Because 
the converse is not necessarily true, this implies that 

{t-U?<e}D{U? +1 -U?<e}, 
and so by monotonicity, 

Pit- U? < e) > P (U? +1 - U? < e) = 1 - e~ Xnt -> 1. 

Taking the limit e — > confirms condition 1. Condition 2 and 3 are true 
because P(| U? |> e) = e" A " e ^ and P (sup fc |t/£ - U^\ > e) = e A " e ->• 
for all e > 0, according to (2.3). The family n 1 = {71^}^, where n l n = {0 < 
t™ < t% < ■ ■ ■ < t™ +1 < t} C 7r„, also tends to [0, t\. And so for the left-hand 
term in (T2) we have that 

N(t) n N(t) n 

fe=i fc=i 

n 

lim Va'- tfc AW/(t£ 



in probability. The integral in term (T2) is an Ito integral by assumption, 
and we can define it as 



Jo 



00 k=i 



Therefore, it we set 

V = *i (2-22) 
then P(T2 > e) ->■ 0. □ 

The introduction of this report mentioned several models similar to the 
above which converge to the Ornstein-Uhlenbeck process and other simi- 
lar processes. However, Theorem 2.3 has at least two features that make 
it interesting. One reason is that pointwise convergence in probability is a 
particularly strong mode of convergence. Previous research has mainly con- 
sidered convergence in distribution [10, 11, 12, 13, 14, 15]. The other is that 
(2.21) and (2.22) in the proof give simple interpretations of the parameters 9 
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(the friction coefficient) and r\ (the diffusion coefficient). Equation (19) says 
that 



where A and m p are 'very large' parameters. Thus, 6 is related to the average 
collision frequency and a fractional mass difference. This simply means says 
that friction on the Brownian particle arises when a relatively heavy particle 
is ambushed by many lighter particles. This is an alternate interpretation 
to the standard 'heat bath' interpretation of the friction coefficient, in which 
friction is an consequence of a particle simultaneously interacting with many 
other particles (see, for example, [26]), rather than a seqeunce of particles 
like we have here. (2.20) says that the diffusion coefficient is related to the 
root-mean-square deviation of the velocities in the surroundings, but that it 
is independent of other quantities such as the frequency of collisions or the 
mass of the particles. 



3 Oscillations between low and high energy 



We now want to describe the equilibration process in terms of an 'oscillation' 
between low energy and high energy states. To make this concept more 
precise, define 



If F n occurs then P has gained a net amount of kinetic energy after un- 
dergoing n collisions with the surrounding gas. If D n occurs, then the test 
particle has lost a net amount of kinetic energy after undergoing n collisions 
with the surrounding gas. For convenience, D occurs a.s. The appropriate 
analogues for these events in for the time velocity process are 




states 



F n = {v 2 > v 2 } 



and 



D n = {V 2 <V 2 }. 



F(t) = {V{tf > V 2 } 
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and 



D(t) = {V{tf < V 2 } . 

Now, let 

F = {v E R : V 2 - v 2 < 0} , 

and 

D = {v E R : -v 2 > 0} . 

D and F are the 'low energy' and 'high energy' states that we are interested 
in. If D n occurs then V n G D and if F n occurs then V n £ F. Similarly, if 
D(t) occurs then V(t) E D and if F(t) occurs then V(t) G F. Clearly, D(0) 
occurs a.s. 

3.1 Recurrence of D and F 

If D and F are recurrent, then P enters and exits the states F and D 
infinitely often (i.o.) with probability 1 as it travels through Q. In this 
situation, P therefore switches between high and low energy states over 
time. We will refer to this switching as an 'oscillation'. While this is not a 
true oscillation, the terminology is useful in order to create the analogy with 
transverse relaxation in magnetic resonance spectroscopy. The occurrence of 
such oscillations is proven in this section. Recall that for a sequence of events 
Ei, E 2 , . . ., 

oo oo 

{E n i.o.} = p| [J E n 

n=l m=n 

= {w G O that belong to infinitely many of the E n } . (3.1) 
Theorem 3.1. P(D n i.o) = P{F n i.o) = 1 

Proof. The event {D n i.o} is permutable because its occurrence will not be 
affected by finite permutations of the indices of Xi, X 2 , . . . [25]. Suppose that 
P(D n i.o) < 1. Then the Hewitt-Savage 0-1 law implies that P(D n i.o) = 0. 
There therefore exists an m < oo such that P(D n ) = for all n > m. In 
other words, 
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v 2 - vl < 

for all n > m with probability 1. (2.3) shows that 



(3.2) 



C n V + J2 X *= Cn ' mV m + Yn-m 
k=l 

where 

n 

Y — c n ~ k Xu 

1 n—m / l- ^>-A;. 

fc=m+l 

Substituting this into (3.2) gives 

r> n-my y _y -> T/ 2 _ ~ 2 (n-m) t^2 
v m ± n — m i n—m ^ * o <-* v m - 

Taking the expected value and noting that V m and Y n - m are independent 
mean zero random variables, we obtain 

OXTo 2 -^^). 

In the limit (n—m) — > oo, the above becomes > <Jq, which is nonsensical. So 
we conclude that P(D n i.o) = 1. A similar argument gives P(F n i.o) = 1. □ 

Now we need to establish the connection between this result and V T 
entering F and D i.o. A very reasonable definition of this event is 

{V T enters D i.o.) = {V n E D, K-i e F i.o.} , 

and similarly for \V T enters F i.o.}. However, Theorem 3.2 immediately 
implies that {V n G D, V n -\ G F i.o.} occurs with probability I. And so we 
have the following theorem. 

Theorem 3.2. P (V enters F i.o.) = P (V enters D i.o.) = 1 

While all molecules in P start in the D state, they each collide with 
molecules from Q at different times, and therefore the oscillations between 
F and D eventually fall out of phase with one another (Figure 4). There is a 
net destructive interference between the oscillations, and therefore they are 
not seen if we study the average kinetic energy of the molecules of P directly. 
In this sense, the thermal equilibration of a gas and transverse relaxation 
of spins in magnetic resonance spectroscopy can be understood in a similar 
way. 
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3.2 Frequency of oscillations between F and D 



The next two sections will characterise the oscillations by studying their 
frequency and period. We will restrict attention to the expected value of 
these quantities. Define the collision crossing number, 

C n = \{1 < m < n such that {V m ^ eD,V m e F) or (V m -i G F, V m G D)}\ . 

C n is the number of collisions out of n collisions that cause P to cross the 
boundary between F or D. The random variable 

W n = C n /n 

measures the frequency at which P enters and exits F as a function of the 
number of collisions. W n is called the collision crossing frequency. 

Theorem 3.3. E(Wi), E(W2), . . . is a strictly and monotoncially increasing 
sequence and E(W n ) — > a, where a < 1 is a positive constant. 

Proof. The second part of the theorem follows from the first via the com- 
pleteness axiom and the fact that W n < 1 a.s. To prove the first part, define 
the family A\, A2, ■ ■ •, where 

A n = {0, 1/n, 2/n, . . . , 1}. 

A n is the state space of W n . Let a™ denote be ith element of A n (i.e., 
a£ = (k — l)/n for k — 1, 2, . . . , n + 1) and 6" = na™. Now, fix an arbitrary 
n and arbitrary a™ G A n . Let 

^n+i = {K+i eFnv n eD}u {V n+1 e Dnv n e F} . 

If i^n+i occurs then the collision velocity process crosses the boundary be- 
tween F and D at the (n + l)th collision. Using this event, the event 
{C n+ \ > bf} can be decomposed as follows: 

{c n+1 > bn = {c n > b r i) u ({c n = bn n H n+1 ) . 

Because {C n > 6™} and ({C n = 6™} fl H n+ i) are mutually exclusive, we can 
then write 

P (C n+1 > 6?) = P (C n > 6?) + P ({C n = bn n ffn+i) . (3.3) 
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To form a strict inequality from this equation, we will show that P({C n = 
bf} r\H n+ i) > 0. Without loss of generality, suppose that 6™ is odd. Because 
Vo G D a.s, this means that if {C n = 6™} then V n G F with probability 1. 
And so 

P {{C n = b?} n H n+1 ) = P {{C n = V}} n {K+i e £>}) • 

Let us suppose that P({C n = &"} fl {K+i € -D}) = 0. According to de 
Morgan's law, this means that 

P({C n = bn c u{V n+1 eF}) = l, 

where A c is the compliment of event A. For any two events Ai,A 2 G f2, we 
have the elementary relationship P(A 1 U A 2 ) = P{Ai) + P(A 2 ) - P(A 2 H Ai). 
If we set P(Ai) = ei and P(A 2 fl A\) = e 2 , then this relationship can be 
written as P{A 2 ) = P(Ai U A 2 ) — (ei — 62)- Applying this to the above, with 
A 1 = {C n = b^} c and A 2 = {V n+1 E D}, we have that 

P(V r n+1 e£>) = l-(e 1 -e 2 ). 
According to this and (2.1), with probability 1 — (ei — e 2 ) the inequality 

V 2 -(cV n + X n+1 ) 2 <0. 

is satisfied. However, X n+1 is independent of V n and V and so this inequality 
will not hold with probability 1 — (ei— e 2 ) for general e\ and e 2 . So we conclude 
that P({C„ = 67} n fZru-i) > 0. From (3.3) we then obtain 

p(c n >K)<p(c n+1 >K). 

In other words, 

P (W n >a^)<P (W n+1 > ay/(n + 1)) . 

To complete the proof, note that a™^ 1 < a™n/(n + 1). This means that 
P{W n+ i > a?n/(n + 1)) < P(W n+ i > a^). Thus, summing both sides of 
the above over A n and using the fact that the cardinality of A n+ i is greater 
than the cardinality of A n , we find that 
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E(W n ) < P > a >/( n + X )) 

< £ P {W n+1 > atl) • 

a"+ 1 e.4 n +i 

The sum on the right hand side goes over all elements of A n +i expect for 1. 
However, W n+ i < 1 a.s. Therefore, 

P{W n+1 >atl)= P{W n+1 ><+ 1 ), 

n+1 a n + 1 yi 

and hence we obtain £'(W n ) < E(W n+ i). Applying this result to the cases 
n — 1, 2, ... in order, we can complete the proof by induction. □ 

This result can also be extended to the time velocity process in a straight- 
forward way. Define the time crossing frequency at time t, 

W{t) = C N{t) /N(t) 

W(t) is the fraction of collisions that have caused the particle to enter or exit 
the state F (or D) out of the total number of collisions that have occurred 
by time t. 

Theorem 3.4. Let t 1? t 2? --- be any sequence of times such that < ti < 
ti < • • • . Then E(W(ti)), EiWfa)), . . . is a strictly monotonically increasing 
sequence and E(W(t n )) — > a, where a < 1 is a positive constant. 

Proof. Under the condition {N(t) = n}, W(t) = W n a.s.. According to 
Theorem 3.3 and the fact that N(s) < N(t) for all s < t a.s., 

E(W(t) | N(t)) > E(W(s) | N(s)) 

a.s. Now, if / is a strictly and monotonically increasing function, then 
E(f(N(t))) > E(f(N(s))). Applying this to the above result gives E(E(W(t) 
N(t))) > E(E(W(s) | N(s))). Equivalents, we have that E(W(t)) > 
E(W(s)) for all s < t. 

To prove the second part, note that for every t E(W(t) | N(t)) > and 
that E(W(t) | N(t)) — > a < 1 according to Theorem 3.3. Using the mono- 
tone convergence theorem, we therefore have that E(E(W(t) \ N(t))) — > a, 
or that E{W{t)) ->■ a. □ 
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Theorem 3.5 says that on average the molecules of P oscillate more and 
more quickly between the high- and low kinetic energy states as P approaches 
equilibrium. We can understand this with the following analogy. Imagine 
that we have a hot piece of metal that we want to cool to a particular 
temperature. So we dip it into a bucket of cold water for a length of time 
Ti to cool it down. However, then we find that the metal is now too cold, 
so we heat it up with a flame for a shorter time period T2 to warm it up. 
However, now the metal is too hot, so then we put it back into the bucket 
of water, and so on. Each step adjusts the temperature of the metal, but 
always ends up undershooting or overshooting the target temperature. As 
we get closer to the target temperature, the length of time that we need to 
heat or cool the metal for becomes shorter and shorter. The oscillations of 
the molecules in P between high and low energy states can be understood 
in the same way. Initially the molecule has too much kinetic energy, so 
the surroundings work to remove energy from it by collisions. Eventually, 
the molecule has too little kinetic energy, and so the surroundings work to 
provide it with more kinetic energy. However, the molecule then ends up 
acquiring too much kinetic energy, and so on. The analogy between thermal 
equilibration and transverse relaxation ends with Theorem 3.5, because in 
the case of transverse relaxation the average oscillation frequency is time 
independent [16]. 

3.3 Estimate of the first hitting time to state F 

Computing the average period of the oscillations appears does not appear 
to be possible in general, particularly because the average periods become 
shorter with each successive oscillation (Theorem 3.5). We will instead esti- 
mate the average length of the first period and use this as a 'reference value' 
to gauge the magitude to the other periods. The average length of the first 
period is the expected value of the first hitting time to state F. Let 

n = inf {t G R+ : V(t) G F) . 

and 

JVi = min (n : V n G F) . 
Putting these together, we have 
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Ni 
i=l 

where Ki, . . . , are a sequence of independent exponential random vari- 
ables with expectation E(K) = 1/A. 

Theorem 3.5. For N\ and t\ defined above, E(N 1 ) < oo and E(t±) = 

Proof. The second part of the theorem follows from the first part and Wald's 
equation [25]. To prove the first part, it is sufficient to show that 

lim P (jVi > to) = 0. 

m— 5>oo 

Now, because D occurs a.s., 

m m 

{ Nl >m}=f]D k =f]F c k , 

k=l k=l 

and so, by de Morgan's law, 



m 



{N 1 >m}=l\J F k 
Because (Jfcli ^fc — (XLi Ufcln ^fc> we can f° rm the inequality 

{TO JTt 
n U Ffc 
n=l k=n 

And so, by monotonicity 

((mm 

Theorem 3.2 then implies that 

lim P (JVi > to) < P ({F n i.o} c ) = 0. 



□ 
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Theorem 3.6. For T\ defined as above, 



cr, 




Proof. Let 



k=l 



so that we can re-write as 

^ =^0 + ^. (3.5) 
By the definition of Ni, > Vq a.s. Alternatively, we can write 

c 2N W 2 + 2 S 2 Ni +2c N ^S Nl >V 2 . 

Using the fact that c 2Nl < c 2 a.s. and that Ni and V are independent, 
taking the expected value of the above gives 

c 2 cr 2 + 2 E(S 2 Nl ) + 2E(^S Nl ) > cr 2 . (3.6) 

We need to deal with the expected values on the left hand side. First consider 
E(c Nl SjvJ. For almost all u> G f2, either uj e {Vjvi = V } or w G {V^ = 
— Vq}. These events are mutually exclusive, and so we can write 



E(S Nl ) = E(S Nl | V Nl -V = -2V )P(V Nl -V = -2V ) 
+ E(S Nl | V Nl -V = 0)P(V Nl - Vb = 0). 

Now, from (3.5) we can form the random variable 

V Nl -V = S Nl -(l-c Nl )V 

Taking the expected value of this under the condition {V^ 1 — V = — 2V } 
gives (because E(V ) = 0), 

E(S Nl | V Nl -V = -2V ) = 0. 
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Similarly, evaluating the expected value under the condition {V^ — V — 
-2V } gives E(S Nl | V Nl - V = 0) = 0. So we have that E(S Nl ) = 0. 
Therefore, E(c Nl S Nl ) = E(E(c Nl S Nl | NJ) = E(c Nl E(S Nl \ N ± )) = 0. 

Now we will prove that E(S%) < E(Ni)a 2 . Following the method on pg. 
187 of reference [25], we can write 

S 2 NlAn = c 2 ^ lA(n _ 1} + (2cX n S n - 1 + Xl) 1(N, > n), 

Because Ni is a stopping time, l(A^i > n) — l(Ni > n — 1) depends at most 
upon Xi, . . . ,X n _i. Moreover, because S n -i and X n are independent, 

E (S 2 NlAn ) = c 2 E (^ lA(n _ 1} ) + a 2 P{Ni > n). 

By induction, 

a 2 x P(Ni > n) + c 2 a 2 x P{Ni > n - 1) + . . . + c 2n a 2 x P(N > 0) 

^£p(Wi>*). 
fe=i 

Taking the limit leads to EffiJ < a 2 E{Ni). 

Substituting these results into (3.6) gives and using the second part of 
Theorem 3.6 completes the proof. □ 

The above proof shows that the tightness of (3.4) is measured by how 
close c is to 1. As discussed in section 2.3, this condition is satisfied when 
the particles of P are very heavy, such as occurs in the Brownian motion. In 
fact, taking the limit c — > 1 gives 

E( n ) = o, 

in other words, a Brownian particle will almost immediately shift into the 
F state upon being displaced from equilibrium. According to Theorem 3.4, 
the Brownian particle will oscillate extremely rapidly between the F and 
D states. Across the ensemble P of Brownian particles, these oscillations 
will occur slightly out of phase with one another, and so the average kinetic 
energy of the particles in P will quickly decay to zero. 

Let us now consider the other extreme where m p = m q . The estimate in 
(3.6) simplifies to E(ti) > a^/Xa^. Moreover, elementary kinetic theory tells 



E (<SjViAn) 



< 
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us that the (Jq/cI = T p /T q , where T p is the initial temperature of P and T q 
the temperature of Q, respectively. And so we have the inequality 



The parameter A will have a dependence upon T p and T q as well, and so 
the relationship between frequency and temperature is not as simple as the 
above suggests. If T p is much greater than T q , then the molecules of P will 
initially be traveling much faster than those of Q and so the molecules of Q 
will appear stationary from the point-of-view of a molecule in P. Elementary 
kinetic theory then says that A oc l/T p , and so 



where a > is a constant independent of T p and T q . This lower bound is very 
large, and so displacing P far from equilibrium will generate oscillations with 
a small oscillation frequency. The experimental prediction of this work is that 
if P contains very few molecules (so that destructive interference between the 
individual oscillations is minimised), then the individual oscillations between 
F and D may be in phase long enough to have a noticible affect on the 
data. In particular, we would expect that thermal equilibration would take a 
longer length of time with a smaller ensemble size for P. This is analogous to 
how ensembles of a very small number of spins stay in phase with each other 
over a relatively long time period during transverse relaxation and phase 
decoherence measurements [27, 28]. The effect of ensemble size on the rate 
of thermal equilibration might be an interesting topic for a future study. 

4 Final remarks 

Through a series of mathematical arguments we have shown that thermal 
equilibration of a high temperature component of a two-component gas can 
be interpreted in a similar way to transverse relaxation in magnetic reso- 
nance spectroscopy. We did this by dividing the kinetic energy state space of 
the molecules of the high temperature component into a 'high energy' and a 
'low energy' component, and by then showing that the molecules shift ('oscil- 
late') between the two components forever over time. Because each molecule 
in the gas enters and exits these states at a different time, the 'oscillations' 
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occur out of phase and interfere destructively with one another, and so are 
not seen when monitoring the equilibration process via the average kinetic 
energy of the gas in the laboratory. We also suggested that the oscillations 
may stay in-phase long enough to have an observable effect on the thermal 
equilibration process if the high temperature component contained very few 
molecules. We also characterised the oscillation by studying its frequency, 
and showed that the frequency strictly increases with time. This is where 
transverse relaxation and thermal equilibration differ, because in the former 
case the average oscillation frequencies are usually constant in time [16]. The 
stochastic model that we used is not standard in the statistical physics liter- 
ature, however we demonstrated that it is consistent with both the two key 
models of thermal equilibration (Boltzmann equation and Brownian motion) 
and so there is no reason to doubt that the model is physically meaningful. 

The transverse relaxation-like 'oscillatory' interpretation of the thermal 
equilibration presented here offers a new and intuitive way of looking at the 
thermal equilibration process. It may be a useful starting point for creat- 
ing models of the thermal equilibration process, and also suggests that there 
might be other connections between thermal equilibration and other seem- 
ingly disparate non-equilibrium phenomena in statistical physics. 
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Figures 




Figure 1: A simplified diagram of transverse relaxation in a magnetic res- 
onance experiment. The diagrams show four spin particles contained in an 
environment E, drawn in the xy-pl&ne. The spins are a long distance apart 
and so do not interact with each other, however they do interact with the 
degrees of freedom of E. The spin of the particles is represented by the parti- 
cles rotating about their spin angular momentum vectors Si (this is indicated 
by the dotted arrows about each particle). At time zero an electromagnetic 
pulse is applied to the xy plane. This places the spin angular momentum 
vectors in the x?/-plane and aligns them in the same direction. The vectors 
rotate clockwise in plane, initially in unison (left hand diagram; this rotation 
represented by the solid curved arrows). However, random interactions be- 
tween each spin and degrees of freedom of E modulate the angular frequency 
of each rotation. These interactions differ from spin to spin, and so eventu- 
ally the the rotating vectors fall out of alignment with each other and end 
up pointing in a variety of directions (right hand diagram). 
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Time 



Figure 2: A simulation of y component of the spin vector during a magnetic 
resonance experiment on two spins. One spin is represented by the black 
line and the other by the red line. The direction of the spin vector in the 
xy plane at various points along the curves is indicated by the arrows. The 
curves are initially coincident, however they fall out of step with one another 
as random modulations of the rotational frequencies take place. The curves 
were simulated with the Kubo oscillator model with Wiener process frequency 
modulation and a mean frequency of 20 (arbitrary units). See [16]. 
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Time 

Figure 3: Thermal equilibration of a two dimensional two component gas. 
The dark grey particles represent the Q component and the light grey parti- 
cles represent the P component. The kinetic energy space of the particles of 
P is divided into two regions, D and F. At time the particles of P are in- 
stantaneously heated by an external source and are all placed in the D state 
(left hand figure). However, random collisions with the particles of Q cause 
the particles of P to shift between the D and F states. Because the particles 
in P each collide with the particles of Q at different times, the particles in P 
end up in a variety of different states at equilibrium (right hand diagram). 
The states D and F will be defined in Section 3 of the main paper. 
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Figure 4: Sketch of the kinetic energy state of two molecules from P as a 
function of time. The two molecules start out in the same kinetic energy 
state. However, because the two molecules collide with molecules from P at 
different times, the oscillations between the two states falls out of step with 
each other over time. This switching between the F and D states is referred 
to as an 'oscillation' in this work. 
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